Search CORE

8 research outputs found

Capture, Learning, and Synthesis of 3D Speaking Styles

Author: Black Michael J.
Bolkart Timo
Cudeiro Daniel
Laidlaw Cassidy
Ranjan Anurag
Publication venue
Publication date: 01/01/2019
Field of study

Audio-driven 3D facial animation has been widely explored, but achieving realistic, human-like performance is still unsolved. This is due to the lack of available 3D datasets, models, and standard evaluation metrics. To address this, we introduce a unique 4D face dataset with about 29 minutes of 4D scans captured at 60 fps and synchronized audio from 12 speakers. We then train a neural network on our dataset that factors identity from facial motion. The learned model, VOCA (Voice Operated Character Animation) takes any speech signal as input - even speech in languages other than English - and realistically animates a wide range of adult faces. Conditioning on subject labels during training allows the model to learn a variety of realistic speaking styles. VOCA also provides animator controls to alter speaking style, identity-dependent facial shape, and pose (i.e. head, jaw, and eyeball rotations) during animation. To our knowledge, VOCA is the only realistic 3D facial animation model that is readily applicable to unseen subjects without retargeting. This makes VOCA suitable for tasks like in-game video, virtual reality avatars, or any scenario in which the speaker, speech, or language is not known in advance. We make the dataset and model available for research purposes at http://voca.is.tue.mpg.de.Comment: To appear in CVPR 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Figure-ground modulation in awake primate thalamus

Author: Adams Daniel L.
Andolina Ian M.
Cudeiro Javier
Jones Helen E.
Salt Thomas E.
Shipp Stewart D.
Sillito Adam M.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2015
Field of study

[Abstract] Figure-ground discrimination refers to the perception of an object, the figure, against a nondescript background. Neural mechanisms of figure-ground detection have been associated with feedback interactions between higher centers and primary visual cortex and have been held to index the effect of global analysis on local feature encoding. Here, in recordings from visual thalamus of alert primates, we demonstrate a robust enhancement of neuronal firing when the figure, as opposed to the ground, component of a motion-defined figure-ground stimulus is located over the receptive field. In this paradigm, visual stimulation of the receptive field and its near environs is identical across both conditions, suggesting the response enhancement reflects higher integrative mechanisms. It thus appears that cortical activity generating the higher-order percept of the figure is simultaneously reentered into the lowest level that is anatomically possible (the thalamus), so that the signature of the evolving representation of the figure is imprinted on the input driving it in an iterative process.Biotechnology and Biological Sciences Research Council (United Kingdom); G022305/1Medical Research Council, (United Kingdom); G070153

Repositorio da Universidade da Coruña

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

PubMed Central

UCL Discovery

Animation Synthesis Triggered by Vocal Mimics

Author: Cudeiro Daniel
Pham Hai Xuan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/10/2019
Field of study

International audienceWe propose a method leveraging the naturally time-related expressivity of our voice to control an animation composed of a set of short events. The user records itself mimicking onomatopoeia sounds such as "Tick", "Pop", or "Chhh" which are associated with specific animation events. The recorded soundtrack is automatically analyzed to extract every instant and types of sounds. We finally synthesize an animation where each event type and timing correspond with the soundtrack. In addition to being a natural way to control animation timing, we demonstrate that multiple stories can be efficiently generated by recording different voice sequences. Also, the use of more than one soundtrack allows us to control different characters with overlapping actions

arXiv.org e-Print Archive

Crossref

HAL-Polytechnique

Nitric Oxide and Synaptic Dynamics in the Adult Brain: Physiopathological Aspects

Author: Abbas L
Aguayo AJ
Ahern GP
Aiba A
Alger BE
Alsina B
Alvarez FJ
Anneser JMH
Aoki C
Arancio O
Arancio O
Arendt
Bailey CH
Bailey CH
Balice-Gordon RJ
Bartlett
Bavelier D
Becker CG
Berardi N
Bernstein M
Biederer
Bishop DL
Bixby JL
Bolton MM
Boroojerdi B
Bouzioukh F
Bouzioukh F
Bozdagi O
Brannstrom
Brannstrom
Bredt DS
Buchs PA
Buonomano DV
Burrone J
Büchel C
Cabelli RJ
Canossa
Carmichael ST
Catania MV
Causing CG
Cavazos JE
Cesa R
Chai H
Chang EF
Chang EH
Che YH
Chen C
Chen C.
Chen J
Chen J
Chen J
Chen R
Chen S
Cheng A
Chevaleyre V
Chklovskii DB
Chub N
Cohen MI
Coleman PD
Colman H
Connor B
Consolo S
Constantine-Paton M
Contestabile A
Copray S
Cotman C W
Cramer KS
Cudeiro J
Cull RE
Curthoys IS
Czeh G
Daniel H
Darlington CL
David S
Davies SJ
Davis GW
de la Cruz RR
de Medinaceli L
de Vente J
Delgado-Garcia JM
Desai NS
Di Monte DA
DiAntonio A
Dityatev A
Dobkin BH
Dun NJ
Dunaevsky A
Eaton BA
Eccles JC
Eckhardt
Edgerton VR
Elbert T
Eleore L
Emery DL
Emre M
Eriksson PS
Ernst AF
Ernst AF
Estevez AG
Estevez AG
Fedele E
Fedele E
Femandes KJ
Festoff BW
Fields RD
Foehring RC
Foehring RC
Frim DM
Froc DJ
Fu SY
Furuyama T
Gage FH
Galante
Gallego R
Gallo G
Gallo G
Gao WQ
Garcia del Cafio G
Garthwaite J
Garthwaite J
Garthwaite J
Garthwaite J
Geinisman Y
Giuili G
Gliddon CM
Goldberger ME
Gonzälez-Forero D
Gonzälez-Forero D
Gonzälez-Forero D
Gonzälez-Forero D
Gordon T
Graf ER
Grutzendler J
Gu Z
Gustaffson B
Hamilton RH
Hansen LA
Hantraye P
Harteil ΝΑ
Hawkins RD.
Hensch TK
Herdegen T
Ito M.
Kaas JH
Kafitz KW
Kakizawa S
Kang H
Kano M
Kashii S
Katz LC
Keilhoff G
Keynes RG
Kimura S
Kirsch J
Kishimoto Y
Kishino A
Kleim JA
Kleim JA
Kleim JA
Kleim JA
KlöckerΝ
Knott GW
Kobayashi NR
Kristensson K
Kuhn DM
Kujala T
Kullmann DM
Kuno
Kuno
Kuzin B
Lee WC
Lessmann V
Levinson JN
Levivier M
Li L
Lin LH
Linda H
Liu XG
Liu Y
Lohof AM
Lois C
Lois C
Lu YF
Luo D
Luo L
Malinow R
Mallory M
Mantelas A
Marder
Mariotti R
Mariotti R
Marty S
Matarredona ER
Matsuura J
Matthews MR
McGraw J
Mendell LM
Messaoudi E
Metzler J
Pizzorusso T
Przedborski S
Purves D.
Ramachandran VS
Reid CA
Rende M
Renteria RC
Reyes-Harde M
Rioult-Pedotti MS
Robbe D
Rocha M
Roskams AJ
Rossini PM
Ruan RS
Rutishauser U
Rüther
Sandvig A
Sanes JR
Santacana M
Sasaki S
Sasaki S
Sauve Y
Saxon DW
Scarisbrick IA
Scharfman HE
Scheff SW
Scheiffele P
Scheiffele P
Schmidt JT
Schulz JB
Schuster T
Sebum KL
Seebach BS
Seeburger JL
Seil FJ
Seki T
Seki T
Seki T
Sernagor E
Sharpless SK
Sheng M
Siegel GJ
Siegelbaum SA
Singer W
Spalding KL
Stamler JS
Stanton PK
Stichel CC
Sumner BEH
Sumner BEH
Sunico CR
Sutula T
Svensson M
Szele FG
Tailby C
Takata M
Tamatani M
Tanaka H
Tanaka H
Tang L
Tao HW
Tarsy D
Terry RD
Tessier-Lavigne M
Titmus MJ
Toni
Toni
Trachtenberg JT
Travers JB
Turrigiano GG
Turrigiano GG
Turrigiano GG
Vassias I
Verhage M
Vicario-Abejon C
Vincent AM
Volgushev M
von Bartheld CS
Waites CL
Wall MJ
Walsh MK
Wang HG
Wang J
Wang SC
Wang T
Washbourne P
Weeks AC
Weeks AC
Weinberger NM
Welker
Welker
Wierenga CJ
Wilson RI
Wood PL
Wu HH
Wu HH
Wu L
Wu W
Wyatt RM
Xiong H
Yokoi M
Yu WH
Yuan W
Zhang N
Zhang R
Zhang X
Zhu DY
Zhu JJ
Zhuo M
Zimmermann
Zito K
Ziv NE
Zoubine MN
Zurumski CF
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

Crossref